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FACTORING  PROBABILITIES  ON  COMPACT  GROUPS 


by 

Persi  Diaconis  and  Mehrdad  Shashahani 
Department  of  Statistics*  Stanford  University 


Abstract 

When  can  a  probability  P  be  factored  as  This  problem  arises 

in  efficient  generation  of  pseudo  random  integers  and  permutations.  It  is 
thus  natural  to  think  of  P  defined  on  a  group.  We  show  that  any  strictly 
positive  measure  can  be  factored.  The  uniform  distribution  can  be  factored 
in  a  non-trivial  way  for  any  compact  group  having  more  than  three  elements. 

If  it  is  required  that  U  =  P  *  P  ,  then  factorization  is  possible  if  and  only 
if  the  group  is  not  Abelian  or  the  product  of  the  quart ernions  and  a  finite 
number  of  two  element  groups. 
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1*  Introduction 


Let  P  be  a  probability  measure  on  the  Borel  sets  of  a  locally  compact 
topological  group.  Following  Levy  (1953),  we  say  that  P  is  decomposable 
if  P  can  be  written  as  a  convolution  P  *  P^ *  P2  with  p  not  fixed  at 
a  point.  The  study  of  decomposable  probability  arises  naturally  in  applied 
problems . 

These  problems  include  computer  generation  of  pseudo-random  uniform  var¬ 
iables  on  a  finite  group  such  as  the  integers  (mod  n)  or  the  permutations. 
Application  is  also  made  to  the  study  of  the  distribution  of  nonpar ametric 
measures  of  correlation.  Applications  are  discussed  in  Section  2.  In  Section 
3  we  show  that  any  positive  measure  on  a  compact  group  Is  decomposable. 

The  uniform  distribution  U  on  a  compact  group  requires  special  treat¬ 
ment  because  of  the  trivial  factorization  U  »  U*P  ,  for  any  P.  We  say  that 
U  Is  semi-decomposable  if  U  »  P^  *  with  not  uniform.  In  Section  3 

we  show  that  U  on  any  compact  polish  group  is  semi-decomposable  unless  the 
group  has  two  or  three  elements. 

If  restrictions  are  put  on  P^  ,  then  factorization  may  not  be  possible. 

For  example,  if  P(A)  =  P(A  ,  then  U  =  P*P  is  never  possible.  The  fac¬ 
torization  U  =  P*P  is  never  possible  on  Abelian  groups,  but  is  possible 
on  most  non-Abelian  groups.  Indeed,  it  is  shown  that  U*P*P  for  all  groups 
different  from  a  product  of  the  quartemions  and  a  finite  number  of  two  element 
groups.  We  also  study  the  possibility  of  factorization  when  P  is  restricted  to 
be  spherically  symmetric  with  respect  to  a  subgroup  forming  a  Gelfand  pair. 

A  motivated  Introduction  to  probability  on  groups  is  in  Grenander  (1963) . 
For  a  comprehensive  treatment  see  Heyer  (1977)  or  Hewitt  and  Ross  (1963), 

(1970).  A  nice  discussion  of  factorization  on  TR  is  in  Chapter  6  of  Lukas 
(1970). 
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2.  Some  Applications 


This  section  describes  some  applied  problems  in  which  factorization 
plays  a  role. 

Example  1.  Generating  random  permutations. 

For  generating  random  bridge  hands,  Monte  Carlo  investigation  of  rank 
tests  in  statistics,  and  other  applications,  a  source  of  pseudo-random  per¬ 
mutations  of  n  objects  is  useful.  If  n  is  small,  a  useful  approach  is  to 
set  up  a  1-1  correspondence  between  the  integers  from  1  to  n!  and  permu¬ 
tations  and  then  use  a  source  of  pseudo  random  integers.  The  factorial  num¬ 
ber  system  is  sometimes  used  for  this  purpose,  see  page  64  and  192  of  Knuth 
(1981).  For  larger  n  ,  like  52,  the  most  frequently  used  algorithm  in¬ 
volves  a  factorization  of  the  uniform  distribution.  Informally,  at  the  ith 
stage  a  random  integer  between  i  and  n  is  chosen  and  i  and 

are  transposed.  Call  the  probability  distribution  at  the  ith  stage  ,  it 

will  be  shown  below  that  ^*^2*  *  * # »  *  ?n-l  is  a  ^actor^zat *on  °f  the  uni¬ 
form  distribution.  Further  discussion  of  this  algorithm  is  on  pp.  139-141 
of  Knuth  (1981).  The  factorization  has  recently  been  applied  in  the  theore¬ 
tical  problem  of  finding  the  order  of  a  random  permutation  by  Bovey  (1980). 

It  also  forms  the  basis  of  fast  algorithms  for  manipulating  permutations. 

See  Furst  et  al.  (1980).  The  following  algorithm  abstracts  the  idea  to  any 
finite  group. 

Subgroup  algorithm.  Let  G  be  a  finite  group.  Let  Gq  *  G  ^  Gr 

be  a  nested  chain  of  subgroups,  not  necessarily  normal.  Let  C^  be  coset 
representatives  for  in  G^  ,  0  £  i  <  r.  Clearly  G  can  be  represen¬ 

ted  as  G  *  Cq  x  C^  x  ...  x  Cr  ^  x  in  the  sense  that  each  g  e  G  has  a 
unique  representation  as  ggg^  •••  gr_^gr  gi  c  and  e  Gr. 


Let  be  the  uniform  distribution  on  and  Gr  respectively.  The 

convolution  *  P2  *  *  *  •  *  pr  is  then  a  factorization  of  the  uniform  dis¬ 
tribution  on  G. 

Specializing  to  the  symmetric  group,  consider  the  chain 

S  ^>S  ,^>S  r>{id}.  Here  S  .  is  represented  as  the  set  of 

n  n-1  n-Z  n-i 

permutations  of  n  letters  that  fix  the  first  i  letters.  Then,  coset  re¬ 
presentatives  can  be  chosen  as  the  set  of  transpositions  transposing  i 

and  letters  larger  than  i.  The  subgroup  algorithm  suggests  a  class  of  algo¬ 
rithms  that  interpolate  between  the  factorial  number  system  and  random  trans¬ 
positions:  Let  S  S  ^  S  with  n  >  n-  >  ...  >  n  .  Here  the 

n  n-  n  i  r 

1  r 

size  of  the  cosets  are  allowed  to  get  large  and  a  variant  of  the  fac¬ 

torial  number  system  permits  choice  of  a  random  coset  element  from  a  random 
integer.  For  example,  consider  the  chain  D  sn«2  Sn-4  3  *  *  *  3  (i<0. 

Coset  representatives  for  sn_2(i+l)  in  Sn-2i  are  Permutations  bringing 
a  pair  of  elements  between  i+1  and  n  into  positions  i+1  and  i+2. 

These  permutations  may  be  ordered  lexographically,  setting  up  a  1-1  cor¬ 
respondence  between  them  and  numbers  1,  2,  ...,  (n-i) (n-i-1) .  An  advantage 
of  this  method  is  that  it  requires  fewer  calls  to  the  random  number  generator. 

The  subgroup  algorithm  can  be  used  to  generate  random  positions  in  the 
currently  popular  Rubick’s  cube  puzzle. 

A  different  application  of  the  factorization  suggested  by  the  subgroup 
algorithm  is  to  computer  generation  of  pseudo-random  integers  (mod  N) .  Given 
two  sources  of  pseudo-random  integers  X  and  Y  ,  computer  scientists  some¬ 
times  form  a  new  sequence  Z  *  X+Y  (mod  N).  Knuth  (1981,  p.  631)  contains 
a  discussion  of  work  by  Marsaglia  and  others.  Solomon  and  Brown  (1980)  show 
that  this  procedure  brings  Z  closer  to  uniform.  Marshall  and  Olkin  (1980, 
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p.  383)  generalize  this  to  any  finite  group.  It  is  natural  to  seek  distribu¬ 
tions  of  X  and  Y  such  that  Z  is  exactly  uniform.  The  subgroup  algorithm 
does  this  when  N  is  composite.  For  N  prime,  see  Lemma  2  in  Section  3. 

Example  2.  Nonparametric  measures  of  association. 

A  wide  variety  of  nonparametric  measures  of  association  arise  from  me¬ 
trics  p  on  the  permutation  group  S^.  Typical  choices  are  = 

2 

{S  (tt  *  ( i)  -  TT(n))  }2  (Spearman’s  rank  correlation),  or  the  minimum  number  of 
pairwise  adjacent  transpositions  it  takes  to  bring  tt  to  tt *  (Kendall’s  tau)  . 
For  a  survey,  see  Diaconis  and  Graham  (1977).  Most  naturally  occurring  metrics 
are  right  invariant: 

p(7r’n,  nr))  =  p(7T’,  tt)  =  p( id ,  tt'tT1)  . 

If  tt*  and  it  are  chosen  uniformly  from  ,  then  the  distribution  of 

p(TT,,7T)  is  the  same  as  the  distribution  of  p(id,U)  ,  where  U  is  a  random 
permutation.  It  is  natural  to  ask  if  tt’tt*”1  can  be  uniform  under  other 
assumptions  on  the  distribution  of  (tt,,tt).  For  example,  if  tt  9 ,  tt  are 
chosen  independently  from  the  same  non-uniform  distribution,  it  is  shown  in 
Theorem  3  of  Section  4  that  tt’tt"’1  is  not  uniform.  The  subgroup  algorithm, 
or  Theorem  5  in  Section  4,  show  that  tt’tt’*1  can  be  uniform  if  tt’  and  tt 
are  merely  independent . 

3 •  Decomposable  and  Semi-decomposable  Probabilities 
Decomposable  probabilities. 

Throughout,  P  is  a  probability  on  the  Bor el  sets  of  a  compact  topolo¬ 
gical  group.  The  probability  P  is  decomposable  if  P  can  be  decomposed 


as  a  convolution  ?  •  with  P^^  not  fixed  at  a  point.  This  definition 

rules  out  the  trivial  decomposition  P  *  P^  *  6^  with  6^  a  point  mass  at 

group  element  g  and  P^  *  P*6  Levy  (1953)  gave  a  nice  example  of  a 

g 

measure  which  is  not  decomposable:  Take  the  group  as  ,  fix  p  in  (0,1) 

and  let  P  put  mass  p  on  the  identity  and  mass  (1-p)  on  a  3-cycle.  It 
is  not  hard  to  show  that  the  set  S  of  support  points  of  P  is  not  of  the 
form  S  *  5^2  with  of  cardinality  2  or  more,  so  P  is  not  decompo¬ 

sable.  The  following  result  shows  that  if  P  «  dg  with  a  positive  continuous 
density,  then  P  is  decomposable. 

Theorem  1.  Let  G  be  compact;  let  P  »  fdg  with  f  >  e  >  0.  Then  P 
is  decomposable. 

Proof.  Suppose  f  >  e  >  0.  Let  probability  measures  P^  be  defined  by 

P1  *  l+e^  {f  +  e^Jdg  ,  P2  -  {Sid  +  e2dg}  , 

with  chosen  so  ei  +  e2  +  ele2  *  0  ;  e.g.,  e1  *  ^/(l+e^)  and 

chosen  positive  but  so  small  that  P^  >_  0.  Then 

P1*P2  =  fdg  +  ^el  +  £2  +  GlG2*d8  "  fdg  •  n 

Remarks.  In  the  case  of  finite  groups,  this  gives  an  easy  proof  of  a  theorem 
of  P.  J.  Cohen  (1959).  Cohen  showed  that  if  the  density  f  is  continuous, 
then  the  measures  P^  can  be  chosen  to  have  densities.  Note  that  in  our 
construction  P2  is  not  absolutely  continuous  with  respect  to  dg  when  G  is 
infinite.  Cohen  gives  an  example  of  a  probability  density  on  a  compact  subset 
of  TR  which  cannot  be  written  as  a  convolution  of  two  probabilities  with  den¬ 
sities.  An  earlier  example  of  Levy  and  a  review  of  the  literature  on  R  appear  in 
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chapter  6  of  Lukacs  (1970).  Lewis  (1967)  shows  that  the  uniform  distribution 
on  [0,1]  cannot  be  written  as  a  convolution  of  two  probabilities  with  den¬ 
sities.  It  is  well  known  that  the  convolution  of  singular  measures  can  have 
a  density.  See  Rubin  (1967)  and  Hewitt  and  Zukerman  (1966)  for  some  examples 

Semi-decomposable  probabilities. 

Turn  now  to  decomposing  the  uniform  distribution  U  on  a  compact  group. 
The  subgroup  algorithm  of  Section  2  gives  any  easy  method  for  decomposing 
the  uniform  distribution  on  a  finite  group.  Consideration  of  the  circle 
group  and  the  subgroup  of  kth  roots  of  unity  suggests  that  the  result  genera¬ 
lizes: 

Lemma  1.  Let  G  be  a  compact,  Polish  group  with  a  closed  subgroup  H. 
Then,  the  uniform  distribution  is  semi-decomposable. 

Proof .  Let  tt  :  G  G/H  be  the  cannonical  map.  Let  Q  be  the  image  of  the 
uniform  distribution  under  tt.  Take  a  measurable  inverse  <j>  ;  G/H  -*  G  with 
the  property  that  tt  4>{gH}  =  (gH).  The  existence  of  <j>  under  our  hypothesis 
follows  from  Theorem  1  in  Bondar  (1976) .  Let  P^  be  the  image  of  Q  under 
<}>.  Let  P2  be  the  uniform  distribution  on  H.  To  prove  that  *  ^2  is 
uniform,  consider  any  continuous  function  f  on  G.  By  definition 


|  f(g)  P1*P2(dg)  =  |g  |  f(8182)  P1Cdg1)P2(dg2)  = 

-L 


4>(G/H) 


{  f(gig2)  PiOfepPj 


f(g>  U(dg) 


The  final  equality  in  the  display  follows  from  Theorem  2  in  Bondar  (1976).  £ 

We  next  show  how  to  decompose  the  uniform  distribution  on  groups  with  no 
proper  subgroups:  the  integers  mod  a  prime.  It  is  easy  to  see  that  the 


<dg2) 
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One  approach 


uniform  distribution  is  not  semi-decomposable  on  Z ^  or  . 

2 

uses  the  fact  that  1+z  and  1  +  z+z  are  irreducible  over  the  reals 
factorization  of  U  leads  would  lead  to  a  factorization  of  the  associated 
polynomial • 

Lemma  2*  Let  p  >_  5  be  prime.  Let  Z^  be  the  integers  mod  p.  Then 
the  uniform  distribution  is  semi-decomposable. 

Proof.  For  i  «  1,  2,  ,  let  a^,  b^  be  determined  by 


a^+2b^  =  1  ,  a^+2b^  cos |  —  0 

Noting  that  cos^~~  “j  f  1  for  i  in  the  indicated  range; 

b±  “  |2(l  '  COS  ’  ai  =  'COS(1Ti_)/  {l  -  COS  H^)  • 


Define  signed  measures  Q,  on  Z  by 

x  P 


Qi(0)  =*  ai  ,  Qi(i)  -  Q1(-i)  =  b±  ,  Q±( j)  »  0  otherwise  . 

The  argument  depends  on  the  discrete  Fourier  transform  of  a  measure.  If  P 

is  a  measure  on  Z  and  k  e  Z  ,  define 
P  P 


P„(P)  -  i  f  PUle2"1^  . 


j-0 


For  the  uniform  distribution, 

Pk(U) 


If  k  -  0 
otherwise 
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r 

I 


i 

\ 


It  is  easy  to  check  that  for  k  ^  0,  P+^Q^)  *  p0^k^  “  Now  let 

signed  measures  R^  and  be  defined  by 


R 


1 


(p-d/2 

R?  =  *  Q,  for  fixed  1  £  a  £  (p-l)/2  . 

i-a+1 


Finally,  for  sufficiently  small  e  the  measures  U  +  eR^  and  U  +  eR^  are 

positive  measures  and  can  be  normed  to  be  probabilities,  say  and 

We  claim  U  =  P1*P2.  Pndeed>  for  k  4  0,  p^(P^  *  P^)  =  P^CP^)  P^*^  ” 

To  show  that  the  decomposition  is  non-trivial,  it  suffices  to  show  that  R^ 

are  non-zero,  i  =  1,  2.  This  follows  from  the  fact  that  for  k  4  j  , 

p.(Q,)  4  0.  Indeed, 

3  R 


Remarks#  Factoring  the  uniform  distribution  on  is  sufficiently  close  to 

some  classical  factorization  results  to  warrant  discussion#  A  well  known  ele¬ 
mentary  probability  problem  argues  that  it  is  impossible  to  load  two  dice  so 
that  the  sum  is  uniform.  More  generally,  Dudewicz  and  Dann  (1972)  show  that 
it  is  impossible  to  find  probabilities  and  P^  on  the  set  (l,  2,  . . . ,  n} 

such  that  Pi  *  P2  is  t*ie  uniform  distribution  on  {2,  2n} .  A  related 

result  asks  for  a  decomposition  of  the  uniform  distribution  on  the  set 
0,  1,  2,  . ..,  N.  Lukacs  (1970),  pp.  182-183,  reviews  the  literature  on  this 
problem#  He  shows  factorization  is  possible  when,  and  only  when,  N  is 
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prime.  The  difference  between  the  three  results  is  this:  In  Lemma  2,  and 
in  the  subgroup  factorization,  addition  is  (mod  N)  .  In  the  dice  result, 
both  factors  must  be  supported  on  {l,  . ..,  n)  while  the  uniform  distribution 
is  on  {2,  . n}.  In  the  results  reported  in  Lukacs,  the  factors  are  per¬ 
mitted  to  have  arbitrary  support. 

The  results  above  can  be  combined  into  the  following. 

Theorem  2 .  The  uniform  distribution  on  a  compact  Polish  group  G  is  semi- 
decomposable  unless  G  is  or  . 

Proof ,  For  finite  groups,  Lemma  2  and  the  subgroup  algorithm  prove  the  claim, 
since  a  finite  group  with  no  proper  subgroups  is  the  residues  of  a  prime.  We 
now  argue  that  every  infinite  compact  group  contains  a  closed  non-trivial  sub¬ 
group.  A  topological  group  has  no  small  subgroups  (NSS)  if  there  exists  a 
neighborhood  U  of  the  identity  such  that  the  only  subgroup  in  U  is  {id}. 
Clearly,  a  group  which  has  small  subgroups  contains  non-trivial  closed  sub¬ 
groups.  A  famous  theorem  of  Gleason  (1952)  implies  that  a  group  with  NSS  is 
a  Lie  group.  The  structure  of  compact  Lie  groups  is  well  known;  see,  for  ex¬ 
ample,  Chapter  11  of  Pontryagin  (1966):  If  G  is  Abelian,  then  the  connected 
component  of  the  identity  is  a  finite  dimensional  torus  which  certainly  has 
non-trivial  closed  subgroups,  hence  G  does.  If  G  is  not  Abelian,  then 
its  maximal  torus  is  a  non-trivial  closed  subgroup.  Q 

4.  Restricted  Factorizations 

The  results  in  Section  3  show  that  in  most  cases,  the  uniform  distribu¬ 
tion  can  be  factored.  In  this  section  the  factors  are  restricted  in  some  way. 
The  first  result  introduces  material  on  representations  which  will  be  used 
throughout  this  section. 


Difference  distributions. 


Let  P  be  a  probability  on  a  group.  Let  ?(A)  =  P(A-1) .  Example  2  of 
Section  2  gives  some  motivation  for  considering  factorizations  of  the  form 
P*?. 

Theorem  3 .  On  a  compact  group,  if  U  *  P  *  P  ,  then  P  =  U. 

Proof .  The  argument  uses  the  basic  facts  about  representations  of  compact 
groups.  See  Chapter  4  of  Serre  (1977)  for  a  review.  We  need  the  following 

facts.  A  representation  p  is  a  continuous  map  from  G  into  GL^  (V)  for  a 

P 

complex  vector  space  V  of  dimension  dp.  The  matrices  p(g)  can  all  be 
chosen  as  unitary:  p(g)  p(g)*  ®  id  ,  where  the  *  means  conjugate  transpose. 

A  representation  is  irreducible  if  the  matrices  {p(g)}^G  leave  no  non-trivial 
subspace  of  V  invariant.  The  transform  of  a  probability  P  on  G  at  repre¬ 
sentation  p  is  P(P)*/G  P(g)  P(dg).  The  matrices  {p(P)}p  G  ,  as  p  ranges 
over  the  finite  dimensional  irreducible  representations,  determine  P.  If  p 
is  non-trivial,  p(U) *  0.  With  these  facts,  the  proof  is  easy.  For  a  unitary 
representation  p(g  )  *  p(g)  =  p(g)  •  Thus  p(P)  =  p(P)  .  If  P*P=U  ,  then 
0*  p(U)  *  p(P*P)  =  p(P)  P(P)=P(P)  p(P)*.  Thus  p(P)  =  0  for  all  non-trivial 
irreducible  representations,  so  P  =  U.  HI 

Square  roots. 

On  a  compact  Abelian  group  the  factorization  U  =  P*P  is  impossible 

unless  P  is  uniform.  This  follows  because  all  irreducible  representations 

2 

are  one-dimensional  and  0  *  p(U)  *  p(P *  P)  *  p(P)  implies  p(P)  *  0.  For 
non -Abelian  groups,  things  are  more  complex. 

Example  3 .  On  a  square  root  P  of  U  can  be  defined  as  follows: 

using  cycle  notation  for  permutations  let 


P(id)  -  P(12)  -  P(23)  -  i+h,  P(31)  -  |-h,  P(123)  -  |-h,  P(132)  -  i+h  , 

for  any  h  with  0  £  h  <  |. 

To  motivate  Theorem  4,  let  us  explain  how  this  example  was  found.  We 

2 

seek  a  probability  P  on  such  that  p(P)  ■  0  for  each  non-trivial 

irreducible  representation  p.  Let  us  find  a  function  f  on  such  that 

p(f)  **  0  for  all  irreducible  p  and  then  P(tt)  *  -g-  +  ef(TT)  ,  with  e  chosen 

small  enough  that  P(it)  ^  0  will  do  the  job.  There  are  three  irreducible 

representations  of  ,  the  trivial  representation  pt  ,  the  alternating 

representation  p  ,  and  a  two-dimensional  representation  p0 .  If  p0(f) 
a  l  z. 

(o  *\ 

is  a  non-zero  nilpotent  matrix  P2(f)  *  Iq  q)  and  Pt(f)  =  Pa(f)  *  0  , 
then  p  (f)  =  0,  This  gives  five  linear  relations  for  the  six  numbers  f (tt)  . 

The  example  above  resulted  from  solving  these  equations.  The  following 
theorem  gives  a  generalization. 

Theorem  4.  Let  G  be  a  compact,  non-commutative  group.  The  following 
conditions  are  equivalent. 

a)  There  is  a  probability  measure  P  4  U  such  that  P  *  P  =*  U. 

b)  There  is  an  irreducible  representation  p  of  G  such  that  the 

algebra  IR  p(g)}  contains  nilpotent  elements. 

Remark.  The  quaternions  ±1,  ±i,  ±j,  ±k  form  a  finite  non-commutative 
group  such  that  the  uniform  distribution  does  not  have  a  non-trivial  square 
root.  This  follows  from  Theorem  5  below  which  identifies  all  finite  groups 
satisfying  condition  b) . 

The  proof  of  Theorem  4  requires  some  notation.  Throughout  we  assume 
that  all  Irreducible  representations  are  given  by  unitary  matrices.  If  p 
is  a  representation,  let  p(g)  be  defined  as  p(g)  *  p(g~*)f.  The  follow¬ 
ing  lemma  is  used  in  the  proof  of  Theorem  4. 
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Lemma  3.  Let  y  be  a  bounded  measure  on  a  compact  group  G.  Then  y 
is  real  if  and  only  if  p(y)  *  p(y)  for  every  irredicuble  p. 

Proof,  If  y  is  real,  then 


Vu) 


PjjCg)  p(dg)  =  P1;.(y)  • 


Conversely,  suppose  y  Is  a  measure  such  that  p(y)  =  p(y) .  This  means 


0  = 


PjjCg)  W(dg)  - 


Pij(g)  P(dg) 


or 


-l 


Pij  <8> 


ii(dg) 


P^Cg)  P(dg)  . 


Since  this  holds  for  every  irreducible  p  ,  the  Peter -Weyl  theorem  implies 
that  the  set  function  y  -  y  is  zero,  so  y  is  real.  ]~~] 

2 

Proof  of  Theorem  4.  If  U  =  P*P  ,  then  p(P)  *  0  and  p(P)  +  0  for  some 

* 

p  because  P  +  U.  Thus  R^  has  nilpotents.  Conversely,  let  y^  e  R^ 

be  nxlpotent.  If  y^  *  0  and  n  is  the  smallest  such  power,  then  set 

y  *  y^  This  is  non-zero  and  y^  =  0.  Define  a  continuous  function  f  on 

G  as  follows.  Set  p(f)  »  0  if  p  4  p*  or  p*,  p*(f)  »  y  ,  and  if  p*  is 

not  equivalent  to  p,  p*(f)  =  y.  This  defines  a  non-zero  continuous  function 

f  through  the  Peter-Weyl  theorem.  Because  of  Lemma  3,  f  is  real.  Clearly, 
2 

p(f)  *  0  for  all  irreducible  p.  It  follows  that  for  e  suitably  small, 

P  ■  (l+ef(g))dg  is  a  probability  satisfying  P*P  **  U.  Q 

A  sufficient  condition  for  Theorem  4  is  that  G  have  a  real  repre¬ 
sentation  of  dimension  2  or  greater:  If  p*  is  an  n-dimensional  real  re¬ 
presentation,  let  f(g)  *  C  p*n(g).  Then  by  the  Schur  orthogonality  relations 
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for  any  P  4  P*,  P(f)  *  0.  Also,  Schur's  relations  lnply  P*(f)  is  an  nxn 
matrix  which  is  zero  except  that  the  lf  n  entry  is  e  /  dg  >  0,  Thus 

p*(f)2  «  0.  Let  a  probability  P  be  defined  by  P*  (l*fcf)dg  ,  with  e  cho¬ 
sen  so  that  P  is  positive.  Then  p(P*P)  *  p(U)  for  all  irreducible  repre¬ 
sentations.  As  an  example,  the  adjoint  representation  of  a  compact  simple  lie 

group  has  a  basis  with  respect  to  which  it  is  real  orthogonal.  Thus,  the  group 
S0(n)  of  proper  notations  for  n-3  ,  and  n  _>  5  admits  a  square  root  of  U. 

The  next  result  classifies  all  finite  groups  such  that  the  uniform  dis¬ 
tribution  is  semi-decomposable. 

Theorem  5.  The  uniform  distribution  on  a  finite  group  G  is  semi-decom¬ 
posable  if  and  only  if  G  is  not  Abelian  or  the  product  of  the  quarternions 
and  a  finite  number  of  two-element  groups. 

Proof .  It  was  argued  above  that  Abelian  groups  do  not  admit  a  non-trivial 
square-root  of  the  uniform  distribution.  In  light  of  Theorem  4,  the  non-Abelian 
groups  with  the  property  that  R^(G)  has  no  nilpotents  must  be  classified.  We 
will  use  a  lemma  of  Sehgal  (1975).  Some  notation  is  needed.  Let  <)  denote 
the  rational  numbers,  and  let  l}(G)  ,  the  rational  group  ring  denote  the  set 
of  formal  linear  combinations  of  elements  of  G  with  rational  coefficients. 

A  non-Abelian  group  in  which  every  subgroup  is  normal  is  called  Hamilton ian. 
Theorem  12.5.4  of  Hall  (1959)  shows  that  every  Hamiltonian  group  is  of  the 
form  G  «  A  x  B  x  H  ,  where  A  is  an  Abelian  group  of  odd  order,  B  is  a 
product  of  a  finite  number  of  two-element  groups,  and  H  is  the  eight  element 
group  of  quarternions  {±1,  ±i,  ±j,  ±k}.  The  following  lemma  has  been  ab¬ 
stracted  from  Sehgal  (1975).  The  result  also  appears  in  Pascaud  (1973). 

Lemma  3  (Sehgal).  If  Q(G)  has  no  nilpotents,  then  G  is  Hamiltonian. 
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r 


Proof.  Observe  first  that  if  R  is  any  ring  with  unit  and  no  nilpotents. 


then  an  idempotent  e  »  e  in  R  commutes  with  every  element  of  R.  In- 

2 

deed,  the  equation  0  *  [er(l-e)]  implies  er(l-e)  »  0  ,  so  er  =  ere. 
Similarly,  re  *  ere  *  er.  Now  let  R  =»  Q(G)  ,  let  H  be  a  subgroup  of  G  , 
and  set  e*  ygy  h*  fol^-ows  that  for  eac^  g  e  G,  geg-1  *  e  and  this 


Implies  that  for  each  h  c  H,  ghg  ^  e  H  ,  so  H  is  normal, 


□ 


Proof  of  Theorem  5.  map  Q(G)  into  Rp(G)  by  mapping  g  p(g)  and  extend¬ 
ing  by  linearity.  This  is  an  algebra  homomorphism.  We  thus  get  a  map  from 
fJ(G)  into  np  Rp(G).  From  Proposition  10  of  Serre  (1977)  this  map  is  1-1. 
Since  no  Rp(G)  has  nilpotents,  neither  does  Q(G) .  Lemma  3  implies  that  G 
has  the  form  G  -  A  *  B  *  H  where  A  is  an  Abelian  group  of  odd  order.  If 
A  is  not  zero,  choose  a  character  X  taking  at  least  one  complex  value.  Let 


p  be  the  irredicuble  representation  of  H  which  sends  iM- 


i  » 


[?  t 


f°  ‘i 

L  °J 


and 


Then  \  ®  p  is  an  irreducible  two-dimensional  representation 


so  R  (G)  consists  of  all  2  *  2  matrices,  with  complex  entries,  and  so 

A.  ** 

contains  nilpotents.  Q 


Spherical  functions. 

As  Theorem  5  shows,  the  uniform  distribution  on  many  non-commutat ive 
groups  has  non-trivial  square  roots.  We  now  show  that  if  some  symmetry  re¬ 
strictions  are  put  on  the  factors,  then  roots  do  not  exist.  Let  G  be  a 
compact  group  and  H  a  closed  subgroup.  A  probability  p  on  G  is  spherical 
if  vKhjAt^)  *  p(A)  for  all  measurable  A  in  G  and  h^,  h2  c  H.  Spherical 
functions  are  the  object  of  intensive  study  in  modern  harmonic  analysis.  See 
Dieudone  (1978)  for  an  extensive  treatment.  The  pair  (G,H)  is  called  a 
Gelfand  pair  if  the  algebra  of  ^  spherical  functions  is  commutative.  Many 
examples  of  Gelfand  pairs  in  probability  are  discussed  in  a  very  useful  survey 
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by  Letac  (1981) .  Compact  lie  groups  G  and  their  maximal  compact  subgroups 
H  form  Gelfand  pairs,  so  the  proper  rotations  S0(3)  and  the  rotations  fix¬ 
ing  a  point  are  a  concrete  example.  The  group  G  ■  and  subgroup  S  ^ 
form  a  Gelfand  pair.  There  is  a  well  developed  transform  theory  for  Gelfand 
pairs  based  on  zonal-spherical  functions.  These  are  spherical  functions  s 
on  G  with  the  property  that 

|  8(g)  ^  *  w2(dg)  -  |  s(g)  lij^dg)  |  s(g)  u2(dg)  , 

for  any  spherical  probabilities  The  theory  shows  that  there  are  enough 

spherical  functions  to  determine  a  measure  and  defines  p(s)  *  /  s(g)  y(dg)  . 
It  now  follows  as  in  the  Abelian  case  that  the  uniform  distribution  on  a 
compact  group  G  does  not  adroit  non-trivial  square  roots  which  are  spherical 
with  respect  to  a  subgroup  H  such  that  (G,H)  form  a  Gelfand  pair. 
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